81 research outputs found

    Survey of extrachromosomal circular DNA derived from plant satellite repeats

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Satellite repeats represent one of the most dynamic components of higher plant genomes, undergoing rapid evolutionary changes of their nucleotide sequences and abundance in a genome. However, the exact molecular mechanisms driving these changes and their eventual regulation are mostly unknown. It has been proposed that amplification and homogenization of satellite DNA could be facilitated by extrachromosomal circular DNA (eccDNA) molecules originated by recombination-based excision from satellite repeat arrays. While the models including eccDNA are attractive for their potential to explain rapid turnover of satellite DNA, the existence of satellite repeat-derived eccDNA has not yet been systematically studied in a wider range of plant genomes.</p> <p>Results</p> <p>We performed a survey of eccDNA corresponding to nine different families and three subfamilies of satellite repeats in ten species from various genera of higher plants (<it>Arabidopsis</it>, <it>Oryza</it>, <it>Pisum</it>, <it>Secale</it>, <it>Triticum </it>and <it>Vicia</it>). The repeats selected for this study differed in their monomer length, abundance, and chromosomal localization in individual species. Using two-dimensional agarose gel electrophoresis followed by Southern blotting, eccDNA molecules corresponding to all examined satellites were detected. EccDNA occurred in the form of nicked circles ranging from hundreds to over eight thousand nucleotides in size. Within this range the circular molecules occurred preferentially in discrete size intervals corresponding to multiples of monomer or higher-order repeat lengths.</p> <p>Conclusion</p> <p>This work demonstrated that satellite repeat-derived eccDNA is common in plant genomes and thus it can be seriously considered as a potential intermediate in processes driving satellite repeat evolution. The observed size distribution of circular molecules suggests that they are most likely generated by molecular mechanisms based on homologous recombination requiring long stretches of sequence similarity.</p

    Repetitive DNA in the pea (Pisum sativum L.) genome: comprehensive characterization using 454 sequencing and comparison to soybean and Medicago truncatula

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (<it>Pisum sativum</it>).</p> <p>Results</p> <p>Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in <it>M. truncatula</it>.</p> <p>Conclusion</p> <p>We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data provide a starting point for further investigations of legume plant genomes based on their global comparative analysis and for the development of more sophisticated approaches for data mining.</p

    Next Generation Sequencing-Based Analysis of Repetitive DNA in the Model Dioceous Plant Silene latifolia

    Get PDF
    BACKGROUND: Silene latifolia is a dioecious [corrected] plant with well distinguished X and Y chromosomes that is used as a model to study sex determination and sex chromosome evolution in plants. However, efficient utilization of this species has been hampered by the lack of large-scale sequencing resources and detailed analysis of its genome composition, especially with respect to repetitive DNA, which makes up the majority of the genome. METHODOLOGY/PRINCIPAL FINDINGS: We performed low-pass 454 sequencing followed by similarity-based clustering of 454 reads in order to identify and characterize sequences of all major groups of S. latifolia repeats. Illumina sequencing data from male and female genomes were also generated and employed to quantify the genomic proportions of individual repeat families. The majority of identified repeats belonged to LTR-retrotransposons, constituting about 50% of genomic DNA, with Ty3/gypsy elements being more frequent than Ty1/copia. While there were differences between the male and female genome in the abundance of several repeat families, their overall repeat composition was highly similar. Specific localization patterns on sex chromosomes were found for several satellite repeats using in situ hybridization with probes based on k-mer frequency analysis of Illumina sequencing data. CONCLUSIONS/SIGNIFICANCE: This study provides comprehensive information about the sequence composition and abundance of repeats representing over 60% of the S. latifolia genome. The results revealed generally low divergence in repeat composition between the sex chromosomes, which is consistent with their relatively recent origin. In addition, the study generated various data resources that are available for future exploration of the S. latifolia genome

    Analysis of the giant genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size.

    Get PDF
    This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.Plants exhibit an extraordinary range of genome sizes, varying by > 2000-fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size. Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria. We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low-abundance repeat-derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal. Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high-abundance repeat families.Thiswork was supported by the Natural Environment ResearchCouncil (grant no. NE/G017 24/1), the Czech Science Fou nda-tion (grant no. P501/12/G090), the AVCR (grant no.RVO:60077344) and a Beatriu de Pinos postdoctoral fellowshipto J.P. (grant no. 2011-A-00292; Catalan Government-E.U. 7thF.P.)

    Stretching the Rules: Monocentric Chromosomes with Multiple Centromere Domains

    Get PDF
    The centromere is a functional chromosome domain that is essential for faithful chromosome segregation during cell division and that can be reliably identified by the presence of the centromere-specific histone H3 variant CenH3. In monocentric chromosomes, the centromere is characterized by a single CenH3-containing region within a morphologically distinct primary constriction. This region usually spans up to a few Mbp composed mainly of centromere-specific satellite DNA common to all chromosomes of a given species. In holocentric chromosomes, there is no primary constriction; the centromere is composed of many CenH3 loci distributed along the entire length of a chromosome. Using correlative fluorescence light microscopy and high-resolution electron microscopy, we show that pea (Pisum sativum) chromosomes exhibit remarkably long primary constrictions that contain 3-5 explicit CenH3-containing regions, a novelty in centromere organization. In addition, we estimate that the size of the chromosome segment delimited by two outermost domains varies between 69 Mbp and 107 Mbp, several factors larger than any known centromere length. These domains are almost entirely composed of repetitive DNA sequences belonging to 13 distinct families of satellite DNA and one family of centromeric retrotransposons, all of which are unevenly distributed among pea chromosomes. We present the centromeres of Pisum as novel ``meta-polycentric'' functional domains. Our results demonstrate that the organization and DNA composition of functional centromere domains can be far more complex than previously thought, do not require single repetitive elements, and do not require single centromere domains in order to segregate properly. Based on these findings, we propose Pisum as a useful model for investigation of centromere architecture and the still poorly understood role of repetitive DNA in centromere evolution, determination, and function

    The ecology of palm genomes: repeat-associated genome size expansion is constrained by aridity

    Get PDF
    Genome size varies 2400-fold across plants, influencing their evolution through changes in cell size and cell division rates which impact plants' environmental stress tolerance. Repetitive element expansion explains much genome size diversity, and the processes structuring repeat "communities" are analogous to those structuring ecological communities. However, which environmental stressors influence repeat community dynamics has not yet been examined from an ecological perspective. We measured genome size and leveraged climatic data for 91% of genera within the ecologically diverse palm family (Arecaceae). We then generated genomic repeat profiles for 141 palm species, and analysed repeats using phylogenetically informed linear models to explore relationships between repeat dynamics and environmental factors. We show that palm genome size and repeat "community" composition are best explained by aridity. Specifically, Ty3-gypsy and TIR elements were more abundant in palm species from wetter environments, which generally had larger genomes, suggesting amplification. By contrast, Ty1-copia and LINE elements were more abundant in drier environments. Our results suggest that water stress inhibits repeat expansion through selection on upper genome size limits. However, elements that may associate with stress-response genes (e.g. Ty1-copia) have amplified in arid-adapted palm species. Overall, we provide novel evidence of climate influencing the assembly of repeat "communities".JP was supported by a Ramón y Cajal Fellowship (RYC-2017-2274) funded by MCIN/AEI/10.13039/501100011033 and by ‘ESF Investing in your future’. SB was funded by a Garfield Weston Foundation postdoctoral fellowship. PN and JM were supported by the ELIXIR CZ Research Infrastructure Project (Czech Ministry of Education, Youth and Sports; grant no. LM2018131).IntroductionMaterials and Methods Plant material collection and genome size measurement Phylogenetic, environmental and genomic data collection Modelling relationships between genome size and environmental variables DNA repeat profiling Assessing repeat dynamics in palm genomesResults Palm genome size variation Aridity preferences of palm species help explain genome size variation Ecological metrics of palm repeat ‘communities’ vary with genome size Repeat abundances correlate with genome size Aridity preferences of palm species explain abundances of certain repeat lineagesDiscussion Palm genome size variation Aridity thresholds best explain palm genome size diversity The ‘community ecology’ of repeats correlates with genome size Repeat dynamics may be modulated by aridityConclusionsAcknowledgementsAuthor contributionsPeer reviewe

    In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae

    Get PDF
    The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes

    Experimental evidence for splicing of intron-containing transcripts of plant LTR retrotransposon Ogre

    Get PDF
    Ogre elements are a distinct group of plant Ty3/gypsy-like retrotransposons characterized by several specific features, one of which is a separation of the gag-pol region into two non-overlapping open reading frames: ORF2 coding for Gag-Pro, and ORF3 coding for RT/RH-INT proteins. Previous characterization of Ogre elements from several plant species revealed that part of their transcripts lacks the region between ORF2 and ORF3, carrying one uninterrupted ORF instead. In this work, we investigated a hypothesis that this region represents an intron that is spliced out from part of the Ogre transcripts as a means for preferential production of ORF2-encoded proteins over those encoded by the complete ORF2–ORF3 region. The experiments involved analysis of transcription patterns of well-defined Ogre populations in a model plant Medicago truncatula and examination of transcripts carrying dissected pea Ogre intron expressed within a coding sequence of chimeric reporter gene. Both experimental approaches proved that the region between ORF2 and ORF3 is spliced from Ogre transcripts and showed that this process is only partial, probably due to weak splice signals. This is one of very few known cases of spliced LTR retrotransposons and the only one where splicing does not involve parts of the element’s coding sequences, thus resembling intron splicing found in most cellular genes

    The giant diploid faba genome unlocks variation in a global protein crop

    Get PDF
    Publisher Copyright: Š 2023, The Author(s).Increasing the proportion of locally produced plant protein in currently meat-rich diets could substantially reduce greenhouse gas emissions and loss of biodiversity1. However, plant protein production is hampered by the lack of a cool-season legume equivalent to soybean in agronomic value2. Faba bean (Vicia faba L.) has a high yield potential and is well suited for cultivation in temperate regions, but genomic resources are scarce. Here, we report a high-quality chromosome-scale assembly of the faba bean genome and show that it has expanded to a massive 13 Gb in size through an imbalance between the rates of amplification and elimination of retrotransposons and satellite repeats. Genes and recombination events are evenly dispersed across chromosomes and the gene space is remarkably compact considering the genome size, although with substantial copy number variation driven by tandem duplication. Demonstrating practical application of the genome sequence, we develop a targeted genotyping assay and use high-resolution genome-wide association analysis to dissect the genetic basis of seed size and hilum colour. The resources presented constitute a genomics-based breeding platform for faba bean, enabling breeders and geneticists to accelerate the improvement of sustainable protein production across the Mediterranean, subtropical and northern temperate agroecological zones.Peer reviewe

    Repeat Composition of CenH3-chromatin and H3K9me2-marked heterochromatin in Sugar Beet (Beta vulgaris)

    Get PDF
    Kowar T, Zakrzewski F, Macas J, et al. Repeat Composition of CenH3-chromatin and H3K9me2-marked heterochromatin in Sugar Beet (Beta vulgaris). BMC Plant Biology. 2016;16(1): 120.Background Sugar beet (Beta vulgaris) is an important crop of temperate climate zones, which provides nearly 30 % of the world’s annual sugar needs. From the total genome size of 758 Mb, only 567 Mb were incorporated in the recently published genome sequence, due to the fact that regions with high repetitive DNA contents (e.g. satellite DNAs) are only partially included. Therefore, to fill these gaps and to gain information about the repeat composition of centromeres and heterochromatic regions, we performed chromatin immunoprecipitation followed by sequencing (ChIP-Seq) using antibodies against the centromere-specific histone H3 variant of sugar beet (CenH3) and the heterochromatic mark of dimethylated lysine 9 of histone H3 (H3K9me2). Results ChIP-Seq analysis revealed that active centromeres containing CenH3 consist of the satellite pBV and the Ty3-gypsy retrotransposon Beetle7, while heterochromatin marked by H3K9me2 exhibits heterogeneity in repeat composition. H3K9me2 was mainly associated with the satellite family pEV, the Ty1-copia retrotransposon family Cotzilla and the DNA transposon superfamily of the En/Spm type. In members of the section Beta within the genus Beta, immunostaining using the CenH3 antibody was successful, indicating that orthologous CenH3 proteins are present in closely related species within this section. Conclusions The identification of repetitive genome portions by ChIP-Seq experiments complemented the sugar beet reference sequence by providing insights into the repeat composition of poorly characterized CenH3-chromatin and H3K9me2-heterochromatin. Therefore, our work provides the basis for future research and application concerning the sugar beet centromere and repeat rich heterochromatic regions characterized by the presence of H3K9me2
    • …
    corecore